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Message 

Examiner Schnizer, 

In preparation for today's telephone interview at 2PM (Eastern) regarding U.S. 10/709,739, please 
find attached the following: 

(1) proposed claim amendments 

(2) Murphy et aL, which shows that miR 112-1 suppresses expression of the gene encoding 
immediate-early (IE) protein 1, which corroborates the disclosure at Lines 6,493,606-6,493,610 of Tabic 7 
disclosing the regulation of NC_00 1347 (i.e., IRS I gene) by GAM501831. 

Best regards, 

Teddy Scott Jr., Ph.D. 53,573 
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ALTERNATIVE 1: PROPOSED AMENDMENTS TO THE CLAIMS FOR U.S. PATENT APPL. 

NO- 10/709,739 

1. - 25 (Canceled) 

26. (Previously Presented) An isolated nucleic acid, wherein the sequence of the nucleic acid 
consists of: 

(a) SEQ ID NO: 4204050; 

(b) a DN A encoding the nucleic acid of (a), wherein the DNA is identical in length 
to (a); or 

(c) the complement of (a) or (b), wherein the complement is identical in length to the 
nucleic acid of (a) or (b). 

27. - 30 (Canceled) 

31 . (Currently Amended) A vector comprising a heterologous sequen ce, wherein the 
heterologous sequence consists of the sequence * i * aURa e rt T wh e r e in th e viral ins e rt consists of the nucleic 
acid of claim 26 , - and wh e rein th e v e ctor do e s not comprise any viral insert otho^than - lh^niiol^o aoid of 
claim 26 . 

32. (Canceled) 

33. (Currendy Amended) A probe comprising a heterologous sequence, wherein the heterologous 
sequence consists of the sequence, w at ino e rtz - wher e in th e viral insert consists of the nucleic acid of claim 
26 „ and wh e rem - th e ^rob e"( ^ » ^^oit^ri^ any viral insert oth e r than the nucleic acid of o1aim -3& 

34. (Canceled) 

35. (Previously Presented) An isolated nucleic acid, wherein the sequence of the nucleic acid 
consists of : 

(a) SEQ ID NO: 117937; 

(b) a DNA encoding the nucleic acid of (a), wherein the DNA is identical in length 
to (a); or 

(c) the complement of (a) or (b), wherein the complement is identical in length to the 
nucleic acid of (a) or (b). 

36. (Currently Amended) A vector comprising a heterologous sequence, wherein the 
heterologous sequence consists of the sequence viml - inaert. wh e r e in th e viral insert oonsists of the nucleic 
acid of claim 3 5, and wherein the veotor do e s not - oomprise any viral insert oth e r than the nucl e ic acid of 
Glaim-35. 



-2- 
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37. (Currently Amended) A probe comprising a heterologou s sequence, wherein the heterologous 
sequencecon^ists of the sequence v iral ins e rt* Wh e r e in th e viral ' ms e rt - eenw gte-of the nucleic acid of claim 
35 , and wh e reiiKh e- probo dooo - not - oompriQ e any virQl ins e rt oth e r than th e nucl e ic aoid - ofa}akT>£5 . 

38. - 40. (Canceled) 



-3- 



PA6E 3/1 1 * RCVD AT 2/2612009 1 1:12:08 AM [Eastern Standard Time] * SVR:USPTO-EFXRF-6/20 * DNIS:2738300 * CSID: * DURATION (mm-ss):03-38 



FEB J 26-2009 10:08 



Polsi ne 1 1 i 



P. 004 




Docket No. 050992.0302.00USCP 



ALTERNATIVE 2: PROPOSED AMENDMENTS TO THE CLAIMS FOR U*S. PAT. APPL- NO. 

10/709,739 

1. - 25 (Canceled) 

26. (Previously Presented) An isolated nucleic acid, wherein the sequence of the nucleic acid 
consists of: 

(a) SEQ ID NO: 4204050; 

(b) a DNA encoding the nucleic acid of (a), wherein the DNA is identical in length 
to (a); or 

(c) the complement of (a) or (b), wherein the complement is identical in length to the 
nucleic acid of (a) or (b). 

27. - 30 (Canceled) 

3 1 . (Currently Amended) A vector comprising a heterologous sequence of a nucleic acid» 
wherein the heterologous sequence of the nucleic acid consists of: 

U) SEP IP NO: 4204050 

(b) a DNA encoding the nucleic acid of fa), wherein the DNA is identical in length, 
toM:or 

(q) the complement of fa) or (b\ wherein the complement is identical in length to the 

naclgicacid of fa),or_(b). 
vim! i ns e rt wh e r e i n th e - v i r al infi e rt - oo^is^> £ ^^ - of : and - wh e r e in 

t he - v e ctor-do e c .- not - oomorir .e- any viral-ins e rt oth e r than th e ntiol e io aoid of olQim - 26. 

32. (Canceled) 

33. (Currently Amended) A probe comprising a heterologous sequence of a nucleic acid, wherein 
thcJbg^crologous.sequence of the nucleic acid consists of: 

(a) SEP ID NO: 4204050 

(b) a DNA encodin g the nucleic acid of (a), wherein the D NA is identical in length 
to (ft): or 

(c) the complement of ftp or fbV wherein the complement is identical in length to the 
nucleic acid of (&) or (b\ 

orobo - mt^yt. - whoTOinlh e p robe in s^-qgH ^fr H pj^m of t -aftd 

whoroin - thc p robe d oa n not oomDrioo - ariv - vira4 - infiort - othor thon th e- nucle i c acid of olaim 
gfe 

34. (Canceled) 
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35. (Previously Presented) An isolated nucleic acid, wherein the sequence of the nucleic acid 
consists of : 

(a) SEQ (O NO: 117937; 

(b) a DNA encoding the nucleic acid of (a)> wherein the DNA is identical in length 
to (a); or 

(o) the complement of (a) or (b) t wherein the complement is identical in length to the 
nucleic acid of (a) or (b). 

36. (Currently Amended) A vector comprising a b_etcr_o_1ojL>ous fteQuencejaJT.a_nuclejc.acid, 
wherein the hclcip,l.o^ouft_spQuence oTjjheaiaeJ^ 

(a) §mmmLinm 

(b) a DNA encoding the nucleic acid of (a), wherein the DNA is identical in length 
to (a) ; or 

(c) the complement of (a) or (b\ wherein the complement is identical in length to the 
nucleic acidof (al_or_(bl, 

^v4r al insert wherein the viral ins e rt consists of a nucl e i o aoid consisting of . and wh e rein 
th ^ oo t^ dootM ^ ot ' Qomprii^e - flnv viral jbgwrt ^_t h efrjfe 35 t 

37. (Currently Amended) A probe comprising a heterologous sequence of a nucleic acid, wherein 
the heterologous sequence of the nucleic acid consists of: 

(a^ SEP ID NO: 117937 

a DNA_en_coding_me_nucleic_acid_of fa), wherein the DNA is identical in length 
to fr); or 

f c) the complement of (a) or fb\ wherein the complement is identical in length to the 

nucleic acid of (a) or (b). 
H3K>b< H^ei^*h^ of » and 

wh e r e in th e v e ctor do e s not opmoris e any viral ins e rt oth e r - than 4h e nuo1 ^ icKi04d '4 i>^ ^w> 

38. - 40. (Canceled) 
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Suppression of immediate-early viral gene expression 
by herpesvir us-coded microRNAs: Implications 
for latency 

Eain Murphy*, Jlri Vanfcek™, Harlan Robins 5 , Thomas Shenk*, and Arnold J. Levlne™ 

'Simon* Center for System* Biology, Institute tor Advdnwd Study. Princeton. NJ 085*0; -Department of Molecular Biology, Princeton University, 
Princeton, NJ 06544; 'Laboratory of Theoretical Physical Chemistry, Ecote PplytechniqueFctidrdle dc Uusanne. Lausanne CH-1015, SwJtterland; 
*nd *f red Hutchinson Cancer Research Center. Seattle, WA $$1 09 

Contributed by Arnold J, Levine. December 1 9, 2007 fcent for review November 20, 2007) 



A quantitative algorithm was developed and applied to predict 
target genes of mtooRNAs encoded by herpesviruses. Although 
there is almost no conservation among microRNAs of different 
herpesvirus 5 ubf ami lies, a common pattern of regulation emerged. 
The algorithm predicts that herpes simplex virus 1. human cyto- 
megalovirus, epstein-Barr virus, and iCapost's sarcoma-associated 
herpesvirus all employ microRNAs to suppress expression of their 
own genes, inducing their Immediate-early genes. In the case of 
human cytomegalovirus, a vims-coded microRNA, miR-1 12-1, was 
predicted to target the viral immediate-early protein 1 mRNA. To 
test this prediction, mutant viruses were generated that were 
unable to express the mfcroRNA, or encoded an immediate-early 1 
mRNA lacking its target site, Analysis of RNA and protein within 
infected cells demonstrated that miR-ULin2-1 inhibits expression 
of the major immediate-early protein. We propose that herpes- 
viruses use microRNA-mediated suppression of immediate-early 
genes as part of their strategy to enter and maintain latency. 

mi RNA* | reactivation | immune evasion 

MicroRNAs (miRNAs) arc 20-23-nucleoticle RNA molecules 
that bind to mRNA targets, generally within their 3 ' untrans- 
lated region (3' UTR), and interfere with their translation (1» 2). 
Viruses have co-evolved with cellular miRNAs and many encode 
their own miRNAs (3), Every herpesvirus genome that has been 
oamiried has been found to encode multiple miRNAs, including 
Epstein-Barr virus (EBV) (4-6), human cytomegalovirus 
(HCMV) (4, 7, 8\ herpes simplex virus 1 (HSV-l) (9, 10), and 
Kaposi's sarcoma-associated herpesvirus (KSHV) (4, 6, 11, 12). 
These miRNAs can potentially function during lytic replication and 
latency. Lytic replication proceeds in a coordinated three-phase 
cascade: immediate-early {IE), early and late. IE products prepare 
the cell for infection and propagate the cascade. Early gene 
products support replication of viral DNA, and DNA reputation is, 
in turn, a prerequisite for full activation of the late genes that 
encode the structural proteins of the virus. During latency, the virus 
is quiescent. A limited subset of the viral genome is expressed, but, 
importantly, the virus has the potential to reactivate and reenter the 
lytic cycle. Although the molecular mechanisms of reactivation arc 
not understood, it is widely assumed that the lytic cascade is 
reinitiated with the expression Of IE genes. Ectopic expression of a 
single IE protein has been shown to reactivate HSV-l (13, 14), EBV 
(15), or KSHV (16, 17) in cell culture models of latency. 

Whereas none or only a few protein-coding genes arc ex- 
pressed, multiple miRNAs are transcribed during latency. The 
HSV-l miR-CAT lies within one of the latency associated 
transcripts (lATs), the only viral RNAs known to' be expressed 
in latency, EBV and KSHV miRNAs Also are expressed during 
latent infection. Because they arc noiinnmtiDOgemc, miRNAs 
should be optimal agents lor suppression Of anti-viral responses 
and to modify behaviors of latently infected cells, and recent 
reports suggest this is the case. The HSV-l miR-LAT has been 
reported to inhibit apoptosis in neurons by inhibiting translation 

wvw.pnas.org/cgi/doi/ 10. l073/pnas.07l^1 0105 



of TGF-0 and SMAD3 (10); the HCMV miR-ULH2-1 represses 
expression of MICB, a ligand of the natural killer cell activating 
receptor NKG2D (18); and KSHV miRNAs down-regulate 
thrombospondin 1 expression, a regulator of cell adhesion, 
migration and angiogencsis (19). 

Virus-coded miRNAs also could act to antagonize the expression 
Ol* IE genes. This could contribute to down-regulation of IE 
expression during later phases or the lytic replication cycle when 
miRNAs have accumulated to relatively high levels, and it could 
help to establish latency and subsequently antagonize reactivation 
of latent virus, given the critical role of IE gene products in the 
initiation of the lytic cascade. Motivated by this reasoning, we 
developed an nuTOJA-target-predicting algorithm and applied it to 
all 3' UTRs of four herpesviruses. The high probability targets 
predicted by this algorithm included IE transactivaiors for all four 
viruses.- 1CP0 of HSV-l; BZLF1 and BRLFl of EBV; IE1 of 
HCMV; Rta and Zta of KSHV, We experimentally confirmed the 
ability of an HCMV-codcd miRNA to reduce 1E1 expression 
Because these genes are necessary for efficient initiation Of the lytic 
program, we propose that herpesviruses use a common rniRNA- 
bascd Strategy for maintaining latency. 

Results 

For further details, please see supporting information (SI) Materials 
and Methods and vSI Tables 3-8. 

Predicting Targets of HCMV-Coded miRNAs Within the HCMV Genome. 

To test our hypothesis that herpesvirus miRNAs might inhibit 
expression of viral genes needed for efficient lytic replication and 
thereby favor latency, wc asked whether vital miRNAs had poten- 
tial to target viral 3' UTRs. Instead of listing all conserved potential 
miRNA binding sites or computing scores based on various em- 
pirical rules, our algorithm uses a combination Of analytical expres- 
sions and Monte Carlo simulations to determine exact probabilities 
that predicted miRNA targets would occur by chance. Wc use the 
standard assumption that the 3' UTR sequence has coevolved with 
the sequence of the miRNA (20; 21) iind the experimental obser- 
vation that miRNA binding requires a perfect complementarity of 
a "Seed" sequence near the 5' end Of the miRNA to a sequence in 
the y UTR (2). This seed is usually a heptamer at positions 2-8 
from the 5' end of the miRNA. As a result of ccevolution, the 
number of actual seed oligomers present in the 3' UTft of a 
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Table 1. Top 10 predicted miRNA-tar^et pairs in HCMV when sorted by PVsh score 



3 'UTR 




hcmv^miR 


Act' 




First-order local MM 




HCMVORF 


Length* 


Exp.* 


Log i6 PVsh 


PVmh 


UL103 


21 


USS-2 




0.000 


-4.11 


0.036 




67 


UL54-1 




0.001 


-2,96 


0.155' 


RL10 


57 


US5-1 




0.003 


-2.52 


0.273 


UL51 


62 


UL112-1 




0.003 


-2,46 


0.229 


UL80 


34 


UL70-5P 




0.004 


-2.42 


0.187 


U134 


14 


UU12-1 




0.004 


-2.41 


0.155 


UL3 


57 


UL70-5p 




0.005 


-2.33 


0.155 


UL69 


253 


U S3 3-1 




0.005 


-2.29 


0.144 


UL57 




US2S-2-5p 


2 


0.108 


-2.27 


0.127 


UL1S3<IE1> 


92 


Ut112-n 


1 


0.006 


-2.21 


0.125 



The table shows the top 10 Of 4,B96 possible miRNA-3' l/TR pair* for the hCmV genome. The statistical 
significance of the toptorgets 1$ measured by the multiplo hypothesis lvalue PV W . The random background used 
Is the first-order Joca I MM. 161 (UL 123) is shown in bold. 
♦Length denotes the total number of all conserved heptamers in the 3'LTfR. 

r AcL donotes the actual count (in the 3' UTR) of conserved heptamers complementary to the mIRNA seed. 
*£Xp. denotes the count expected In the random sequence. 



targeted gene will be higher than the number thai would appear by 
chance in a random sequence with similar composition. The 
algorithm predicts functional miRNA targets in two steps. 

First, for each miRNA-3' UTR pair, we compute an approximate 
probability PVsh (Pvalue for single hypothesis testing) that ttwould 
appear by chance in the random sequence; the smaller PV S h is, the 
more Jikely the given pair is to be biologically functional. (Proba- 
bility FV$h is very nearly exact: The only approximation is that we 
assume independence between consecutive oligomers.) This pro- 
cedure alone allows testing whether a given miRNA is likely to 
target a given 3' UTR and is analogous to that used by Robins and 
Press (21). 

Second, if we are interested in finding functional targets of 
multiple miRNAs among multiple 3' UTRs, we need to take into 
account multiple hypothesis testing. We do this by performing a 
Monte Carlo simulation in which we compute the probability PV MH 
(P value for multiple hypothesis testing) that the top, say JO, 
miRNA-target pairs in a randomly generated genome with similar 
properties would have their PVsh lower than the corresponding top 
10 miRNA-target pairs in the real genome. We used this approach 
instead of the now standard False Discovery Rate analysis (FDR) 
of Bcnjammi and Hochberg (22) because of the discrete nature of 
Out data. In our case, most PV$ W values are I, and so FDR analysis 
is not applicable because it requires a fairly uniform distribution of 
PVsn except a small overrepresentation at values close to 0. 

Details and various extensions of the algorithm, such' as the 
generation of the random background sequence based on the 
Markov model (MM), the calculation of Ihe single and multiple 
hypothesis P values, the use of conservation (21) among Strains of 
the virus, and the choice of the seed sequence, arc described in 
Materials ami Methods, 

We applied the algorithm to all 3' UTRs of the HCMV genome, 
and Table 1 shows the 10 most probable miRNA-target pairs of the 
4896 total possible miRNA-3' UTR pairs. For each pair, the table 
shows the score PVst-t and the statistical significance PVmh of all 
predictions up to this one. For instance, the 10th miRNA-target 
prediction, miR-ULH2-l targeting the IE tninsacuVator protein 1 
mRNA (TE1 , encoded by the UL123 ORF, Inghligh' ted), has a score 
PVsh - 10 _Z21 « 0.0062 and FV M n ~ 0.125, meaning only 12,5% 
of randomly generated genomes have top ten P values better or 
equal to PV ai = lO^ 2 * 21 . For top 25 most probable miRNA-target 
pairs, see SI Table SC. In fact, the data set in diat table suggests that 
the most significant predictions are the top 10 listed in Tabte 1 
because there is a sharp increase in PVmh from the 10th to 11th 
prediction: PVmh (10) = 0.125 and PV^ (11) = 0.309. Naturally, 



PVmh (*) increases toward 1 for larger k. In our analysis, wc 
required that a target be conserved in six sequenced strains of 
HCMV. If conservation among strains is not taken into account, 
PVmh suggests that there arc many more significant targets (35 with 
PVmh < 0.20, sec SI Table 5C). Finally, the PVmh values listed in 
Table I are wnscrvatrvc upper bounds because we considered all 
published sequences of detected potential miRNAs although sev- 
eral arc only slight variations of each other and some others arc 
perhaps not real miRNAs. 

HCMV I El Protein Synthesis Is Suppressed by rmR-UI-HZ-I. Inhibition 
of any of the genes in Table 1 could potentially favor latency, but 
we considered IE1 to be a prime target, given its central role at the 
Start of the HCMV transcriptional cascade. IE1 is one of two main 
products of the HCMV major IE locus, the other being IE2. IE1 and 
IE2 are required to execute the virus' transcriptional program (23), 
and they almost certainly influence the choice between latency and 
lytic replication. A mutant virus unable to produce a functional IE1 
protein replicates efficiently only after infection at a high input 
multiplicity (24); at lower multiplicities, it fails to accumulate 
normal levels of early mRNAs (25). Jt activates transcription at least 
in part by controlling histonc modifications (26). 

The algorithm predicted a single binding site for raiR-UL112-l 
within the 99 nucleotide 3' UTR of the IE1 mRNA. To test the 
prediction that miR-UL112-l mhibits translation of IE1 protein, we 
prepared two reporter constructs. Hie first contained the wild-type 
EE1 3' UTR downstream of the lucifcrasc coding region and the 
second contained a derivative of the 3' UTR lacking the 7-micle- 
Otldc seed sequence predicted to be the target of the miRNA (Fig. 
\A t shaded sequence). HEK293T cells were cotransfected with set 
amounts of the reporter plasm ids and increasing amounts of an 
effector plasmid expressing the miR-ULUZ-l precursor hairpin 
sequence. The miRNA induced a statistically significant reduction 
in luciferase expression from the reporter with a wild-type IE1 3' 
UTR (maximum repression - -60%) but not from the modified 
3' UTR lacking the seed sequence (Fig. IB), arguing that miR- 
UL112-1 targets the seed sequence within the IE1 3' UTR to 
reduce translation or degrade the RNA 

We next generated three viruses to test whether miR-OLll2-l 
tturgets DEI expression within an HCMV-infbctcd cell. The first, 
BFX^IElds", lacks the 7-nt seed sequence within the IE1 3' UTR 
that is targeted by the miRNA. The second, BFXw*M12-l*~, is 
unable to express the miRNA The rniR-ULllM precursor is 
encoded on the t>NA strand opposite UL1 14, and disruption of th is 
ORF inhibits virus replication (27, 28). Consequently, we substi- 
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fig. 1. mlR-UU 12-1 Inhibits expression from a ro porter. mRNA containing 
the i£i 3' UTR. (A) The predicted mi fMJ L1 12-1 -binding srte within tho KCMV 
major IE focus. At the top of the diagram, the spiked mRNAsthat encode iSi 
and IE2 are depicted with the nOncoding exon 1 (Ex1> shown as ftn open box 
and the coding exons (Ex2-5) depicted as gray boxes, l£l and IK share Ex2 and 
Cx3J The PolyA sites and the location of the rttiR-ULl 1 2-1 binding site in the 3' 
UTR (gray pinheod) are shown. At the bottom of the diagram, the IE1 3' UTR 
sequence is expanded a nd the putative miR NA/mRN A base pa iri ng is de pitted. 
The gray box denotes nucleotides within the miRNA seed sequence, (fl) 
Reporter assay for miR-UL1 12-t function. 293T cells were cotronsfGCtod wfth 
firefly lucif erase expression plasm ids containing either the wild- type flight 
gray) or mutant IE1 3' UTR (dark gray) as well as a Renilta lucrfcraso internal 
control. Cells were additionally cotransfected with the indicated amounts of 
a miR-ULll2-l expressing plasmid, and transfection mixtures were balanced 
with the. expression plasmld lacking an Insert. Firefly ludferase units were 
normalized to RenllJa lucif erase. The iu erf erase units are shown relative to the 
amount of lucif erase from the reporter construct in the absence of miRNA 
expression plasmids. Asterisks denote P values <0.05 as determined by the 
Student's t test 



tutcd 12 nucleotides within the miR-TJLll2-L pre cursor sequence 
while rnamtaining the coding sequence of the UL114 ORF (Fig. 
24). The m£R-UL112-l mutation was repaired in the final vims 
BFXsuAl 12-lr; to control for potential ofT-target mutations. The 
viruses grew normally in fibroblasts (Fig. 25). Wc also monitored 
accumulation of miR-UL112-l by quantitative ftT-PCR. The 
miRNA accumulated to a detectable level between 8-32 h after 
infection with wild-type virus and then increased as the infection 
progressed (Fig. 2C). No mifc-TJL112*l was detected at 48 h after 
infection with BFXswfrll2-l'" > a time at which the miRNA was 
readily detected in cells infected with the other viruses (Fig. 2D). 

To determine whether FBI protein levels were affected by the 
expression of raiR~UL112-l> we prepared extracts from infected 
cells after a 1-h ^S-labeling period at 6, 24 and 48 hpi with wild-type 
or mutant viruses. Wc did not monitor celts later than AH hpi. even 
though the miRNA accumulated to higher levels at 72 hpi (Fig. %A\ 
because infected cells show severe cytopathic effect at the later 
time. We first examined the steady state levels of several proteins 
by Western blot assay (Fig. 3A Top). Tubulin levels, which arc not 
altered by infection, provided a precise measure of the amount of 
cellular protein analyzed in each sample; and the accumulation of 
the late HCMV protein, pp2S\ confirmed that alt infections pro- 
gressed normally. Wc monitored IE1 steady State levels, but litde 
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Fifl. 2. BFX*u6UL1 1 2-1 " and BPXd/)£ic*~ grow wfth wiJd-type kinetics. (A) 
Tho nucleotides mutated In the SFXtt/ouuiM" (112-1-) relative to BFXvvt 
(WT) arc shown with g ray lines and the complement of the mature miRNA is 
shaded within the WT sequence- The substituted nucleotides in 1T2- 
1 -conserves the amino add sequence (shown above and below nucleotide 
sequences) of tho UL114 gene. (S) mrc-5 fibroblasts were infected at a 
multiplicity Of 0.1 pfu/ceUwithWT,1l20- / 3FXsu^n2or(ii2-ir,arevertant 
of ii2-t-)orBFXdAEicis- CiEttis", lacks the 7-nudect id e seed sequence in iei 
mRNA). Virus 0U r infectious units) was assayed at the indicated times. CO 
qflT-PCR wtth a mlR-uLll2-t specific probe was used to detect the accumu- 
lation of the miRNA in WT-irrfected MRC5 fibroblasts overtime. miR-UL1 12-1 
levefe w«n» normalized to tha axpreaton of a cetlul&r small nucleolar RIMA. 
hsntie. M, mock infected. (D) Accumulation of miR*112*1 was assayed by 
qRT-PCR at 48 hpi with the indicated viruses and normalized to hsn$8 expres- 
sion, Tho 112-1- virus had rruRMA level* similar to mock infected cells. 



difference was evident after infection with wild-type and mutant 
viruses. This was not surprising, because IEI protein has a >20-h 
half life, and it accumulates to a high level before the miRNA is 
available. 

Next, IEI was mimunc^iecipitaled from extracts and subjected 
to electrophoresis to identify protein synthesized during each 1-h 
labeling period (Fig. 2A Bottom). The rate of DEI synthesis was 
substantially greater at 6 hpi than at later times for all viruses, and 
this is expected because the promoter responsible for the produc- 
tion of IEI mRNA is repressed late alter infection (29, 30). 
Radioactivity in the IEUspecific band was quantified relative to the 
level of tubulin, and Fig. 3u5 Top presents the results of two 
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Fig. 3. VlruM* that lack miR«UL1 1 or to binding $rt<? synthesize more IE1 
protein. (A) MRC5 fibroblasts were mock-Infected Cm) or infected with BFXvvt 
(WT), BFXMfcl 12-1" (i 1M-), BFX«/o1 12-1r (1 12-1 r). or BFXtf/IEtcts- (lEIds"). 
Cdb were 5S S-tebeled for 1 h before harvesting at the indicated times after 
infection, lysateswere prepared and analyzed by Western bloffor IE1, the late . 
virus-coded pp28 or tubulin (Upper) ot immurtOpfecipitetion followed by 
electrophoresis for 35 S-labeled lEt (Lower). The experiment shown is a repre- 
sentative of $ independent immunopredpitotion^ O Top) Quantification of 
labeled l£1 relative to tubulin. IE1 protein levels were Quantified by 
phosphorirnager onafysb of imrriunoprccipeted complexes from two in depen- 
dent experiments, each of which was analyzed by three Independent immy- 
nopredphations, such as that displayed \f\A tower, The levels of IE1 protein 
were normalized to tubulin levels from the Western blot *n A. The mutant and 
reverent viruses are normalised to WT levels for each time point. P values 
were 4 etermlned by the Student's ttest. (B Middle) Quantification of IE1 RNA 
relative to UL37 RNA byqRT-PCR. Mutant and repaired viruses are normalized 
to WT levels for each time point (tower) ratio IE1 protein (from 7op) to IE1 
RNA (from Middle). 



independent experiments, each analyzed by performing three in- 
dependent immunoprecipitations. At 6 and 24 hpi, we did not 
observe an effect attributable to miR-UL112-l activity, consistent 
with. the observation that the miRNA is not detected at 6 hpi and 
relatively little is present at 24 hpi. In contrast, at 48 hpi, when the 
miRNA has accumulated to higher level* the miR«ULll2-l- 
deficient and the IE1 target she-deficient mutants exhibited statis- 



tically significant increases (^-2-fold) in IE1 protein synthesis rel- 
ative to the wild-type and rcvcrlani viruses. 

At each time protein extracts were prepared, total RNA was 
isolated from a duplicate sample, and the amount or IE1 RNA was 
determined relative to the level of an independent IE RNA (U137) 
by quantitative RT-PCR. IE1 RNA levels varied little among the 
viruses (Fig. ZD Middle), arguing that the miRNA docs not signif- 
icantly alter the- stability of TEl mftNA and 'supporting the con- 
clusion that the changes in IEJ. protein levels result from the 
inhibition of translation. The ratio of XET protein to RNA was 
calculated (Fig. 3B Lower), confirming a significant increase in 
protein synthesis when either the miRNA or its target site is 
disrupted. 

Predictions of Targets for miRNAs Coded by Other Herpesviruses. 

Encouraged by the experimental verification of our prediction in 
HCMV, we applied the algorithm to an analysis of three additional 
human herpesviruses, HSV-1, EBV, and I<$HV each proved to 
encode miRNAs predicted to inhibit the expression of viral pro- 
teins, including IE proteins. Table 2 displays the rank of the 
retargeting miRNAs among all possible miRNA-3' UTR pairs 
(the total number is equal to the number of 3' UTRs times the 
number of miRNAs) . The rank is again based on the P value PVsn 
computed according to the local first order MM or the global third 
order MM. ICPO in HSV-1, BZLF1 and BRLF1 in EBV, and Zta 
and Rta in KSHV are among the virus-specific targets most likefy 
to be targeted by virus-coded miRNAs (top OS-2% ofvirus-specific 
targets). The BZIJFVBRLF1 3' UTR of EBV is predicted to be 
targeted by two miRNAs, 

SI Tables 3-6 Show three additional pieces of information for 
each virus: First, there is alist (Si TaWes M-tA) tor each miRNA 
of the total actual and predicted number of binding sites across all 
3 ' UTRs with associated P value, miRNAs with smaller P value are 
more likely to regulate some (unspecified) viral genes. The total 
number of functional binding sites for miRNAs can be estimated 
from the difference of the total numbers of actual and predicted 
seed binding sites (21). It is reassuring that this number is always 
positive, regardless ot the virus or the MM used. Second, there is 
a list (SI Tables 3B-6B) of the top 25 3' UTR targets, sorted 
according to the P value based on the total actuid and predicted 
binding-site counts across all miRNAs. 3' UTRs with small P value 
are likely to be regulated by some combination of viral miRNAs. 
Third, there is a list (SI Tables 3C-6Q of the top 25 miRNA-3' 
UTR pairs. Pairs with small lvalue are most likety to be functional 
pairs. The ranks of the IE genes in Table 2 are derived from this list. 

Besides the IE genes, the top predicted miRNA targets include 
many genes involved in viral DNA replication as well as several 
inhibitors of apoptosis and other genes involved in immune evasion. 
Brief descriptions of the predicted targets in these functional groups 
arc summarized in SI Table 7. 

Discussion 

Our computational analysis predicts thai miRNAs encoded by 
members of three herpesvirus subfamilies target viral IE genes. 
HSV-1 (^-herpesvirus) miR-LAT likely targets ICPO. HCMV 
(/3-herpesvirus) miR-UL112-l targets ESI. EBV (v-hcrpesvirus) 
miR-BHRJPl-3 and miR-BART15 likely target- BZLF1 and 
BRLFl, which share their 3' UTR (Table 1 and SI Table 4C). If the 
effect of all EBV miRNAs is combined, this 3' UTR is in fact the 
top target in the whole genome (SI Table 4B). Li addition, the Seed 
binding is actually longer than the heptamer 2-8 used in the 
analysis; it is an octamer 1-8 for miR-BHRFl-3, and if noocanoni- 
cal binding U~C is allowed, a 9-mer 210 for miR-BART15. KSHV 
( r herpesvirus) Rta (ORF 50) and Zta (ORF K8) are homologues 
of EBV BRLFl and BZLF1, respectively, although functionally 
BZLF1 seems 10 correspond more to the Rta. The 3' UTR shared 
by Zta and Rta '*$ 1 ikely targeted by the KSHV miR-KlO-6-3p, which 
is numerically the strongest prediction of this algorithm (99.93 
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Table 2. Whole genome ranks for predicted mi ANA-IE target pairs in four herpesviruses 



Virus 


3'UTR* 


Length 


miRNA 


Seed 


Count 


Rank A' 


Percentile 


Rank B 


HSV-1 


ICPO 




hsv1-miR-lAT 


3-& 


1 


4of 154 


97.40 


12 of 154 


E3V 


62LF1, BRLF1 


53 


ebv-mft-BARTIS 


2-* 


1 


3 of 2,720 


99.S9 


4 of 2,720 


EBV 


BZLF1, BRLF1 


53 


ebv-miR-BHRFl-3 


2-8 


i 


4 of 2,720 


99.85 


3 Of 2,720 


HCMV 


IE1 


92 


rvcmv-rniR-UL112-1 


2-3 


1 


10 of 4,896 


99.80 


9 of 4,896 


KSHV 


Zta, Rta 


1,144 


kshv-miR-K12-6-3p 


3-8 


4 


1 of 1,394 


99.93 


1 Of 1,394 



The t»ble reports the top miRNA-IE target pairs for hj\m, £BV, KSHV, and HCMV after sorting by PVy, score- 
*BZLP1 and BRLF1 as well as Zta and Rta give rise to 3' coterminal transcripts and therefore genes In each pair have the same 3'UTRs. 
T Rank A (resp. rank B> donotcs the rank among all possible miRNA-3'UTR pairs sorted by P values computed for the random sequence based on the first-order 
local (resp. tho third-order aJobaQ MM. Percentile correspond* to Rank A. 
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percentile), Finally, besides the IE genes, the anafysis (SI Tables 
3*6) suggests numerous other likely miRNA targets, eg,, the 
HSV-1 neurovirulencc factor y543, whose RNA contains an 
octamer seed binding site for nriR-LAT. 

Wc confLrmcd the predicted inhibition of HCMV DEI translation 
by tniR-UL112-l within transfectcd cells by using reporter con- 
structs (Fig. 1) and within virus-infect cd fibroblasts by analyzing 
mutant viruses (Figs. 2 and 3). As we were writing this manuscript, 
Cray et aL (31) reported the same conclusion. They demonstrated 
that addition Of miR-lJU12-1 RNA to cells before infection 
inhibited IE1 expression and viral DNA replication. 

Given the broad range of predicted targets (SI Tables 3-6), we 
anticipate that herpesvirus-coded miRNAs exert regulatory effects 
directly on viral gene expression during replication and spread 
within infected hosts. This regulation could have many conse- 
quences e.g., down-regulating viral genes as the infectious cycle 
progresses to avoid toxicity and helping to modulate viral gene 
expression to optimize replication in a variety of different cell types. 
Most importantly, however, our results also suggest that virus- 
coded miRNAs could play a central role in the establishment and 
maintenance of latency. Because they target IE products that act at 
the top of the lytic cascade, miRNAs expressed in cells destined for 
a latent infection can potentially antagonize the cascade and 
thereby favor entry into latency. Furthermore, miRNAs expressed 
during latency could help to prevent reactivation by iniuntting 
translation of IE rrarisaetrvators. 

There arc several indirect justifications of this model. First, 
although expression of HCMV-codcd miRNAs during latency has 
not yet been evaluated, HSV-1, EBV, and KSHV miRNAs are 
transcribed and available to function during latency. Second, if 
miRNAs arc indeed Central agents responsible for the maintenance 
of latency, this would explain the mysterious failure to detect 
proteins in the latent stage of HSV-1. Finally, our algorithm predicts 
that numerous early genes involved in DNA replication are targeted 
by viral miRNAs. In fact the EtfVmiR-BART2 cleaves the mRNA 
of the DNA polymerase BALF5 (32). Early genes are normally 
activated by IE products, and their activity in turn would be 
expected to importantly influence the choice between lytic repli- 
cation and latency. 

In sum, antagonism of TE genes, which form a control node at the 
top of the herpesvirus lytic cascade, could contribute intportantly to 
the establishment of latency and seems an optima] strategy for 
maintenance of latency. 

Materia b and Methods 

Computation. To determine the likelihood that a particular miRNA-3' UTR pair is 
functional, we computed the corresponding probability Pv*sh, Let C denote the 
actual count of seed rwners in the 3' UTR of length randp the probability (based 
on the MM described in Si Materia/* arte* Methods) that any given n-mer in the 
. random background sequence is the seed n-mer.Then ourPvoluo PVsh gives the 
probability of finding at least c seed n-mers in a background sequence of length 
/which is equal to tho P value Of the binomial distribution. 



in practice, / is of the order of too or 1,000. For a hexamer seed sequence (n - 6), 
a typical p is V4 K - 1/4,096 (exactly rf all hexamcrs wore equally likely) and 
-tho refer o a typical c Ss zero, making the equation above impractical. An alterna- 
tive exact expression for PVsh, which tc numerically efficient is 



pv s „ - PV rtD (/ - h + i 9 c,p) = 



B(p,c t I - n - c + 2) 
B{c f l~n -c + 2) * 



where fl(*.fl,b) is the incomplete beta function and 0fr,O) H the usual beta 
function, 

Jo 

The statistical significance of the top miRNA-target pairs was evaluated by 
calculating probability PvW Because the majority of P value PVsh »s equal to 
^ t we cannot use the standard method of estimating the false discovery rate 
(22). Instead, we use the following Monte Carlo procedure: First generate n - 
1,000 random genomes analogous to the actual genome of interest. This 
moans that each genome will have exactly the same number of 3' UTRs as the 
genome of interest and each generated 3' UTR will be of the same length as 
the corresponding real 3' UTR. Each random 3' UTR ts generated by using the 
Ath order MM based on the composition of the corresponding 3* UTR in the 
real genome. 

For each of the N randomly generated genomes, we fOpoatthesame analysis 
Of computing PV*, as we did for the real ponomo; i,0 v we compute the score PVsh 
for each miRNA-3' UTR and sort them. Now, evaluate the statistical significance 
of the top f miRNA-target pairs for the actual genome by counting the number 
W e of the randomly generated genomes in which the tth top microRNA-3' UTR 
pair has PVansmaKer than the tth pair in*tho*ctwl$<^omc. For each t compote 
the lvalue PV MM (tf corrected for multiple hypothesis testing by 

PV M h(')~~. 

PVWt) is the probability of finding better scores for The top t potential mi- 
• croRNA-3' UTR pain in a random genome Wrth similar properties as the actual 
genome. The smaller P\Wfl. the higher the chance that the predicted targets 
are reel targets- 
Additional details relevant to computation a re described in S! Materfok one* 
Methods, including sequences used in the analysis, choice of the seed sequence, 
construction of random background sequence, use of conservation between 
strains or species. 

Cells, Viruses, and- Plasmids. MRC5 and cells were propagated in 

medium with 10% FBS or 10% newborn carf serum, respectively. 

The wild-type virus used in these studies Is BFXwf-GFP. It is a derivative of 
a bacterial artificial chromosome (BAC) done of the HCMV VR1 814 clinical 
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isolate (33) In which * gre«n fluorescent protein (GFP) expression cassette has 
been inserted upstream of the US7 Qftf. Three derivatives of BfXivMaFP were 
produced by using ga!K selection and counter selection to modify BAC DMAs 
(34) *s described in 57 Materials andMathods. BFXoTlEI lacks the 7-nt seed 
sequence for miWl 2-1 within the IE1 3' UTR, BFXsubl 12-1" contains 12 single 
base pair Substitutions that block expression of miR-1 1 2-1, BFXsvM 12-1 r Is a 
repaired derivative of BFXJuibn2-1"\ virus was generated by electroporation 
pf MRCS cells with BAC DMA (20 fig) plus an MCMv* pp71 -expressing plasm id 
(pCGNpp7i). Virions were purified by centrifugation through a 20% sorbitol 
cushion. Virus titers were calculated by Infecting fibroblasts and counting 
162-positive fog at 24 hpi. 

The construction of plasmlds is described in SI Materials and Methods. 

mRN A and mlRNA Quantification. Real-time RT-PCR was performed en total RMA 
isolated from the cells by using the mirVana miRNA isolation kit. (Am bio n), which 
isolates total RNA while preserving the mlRNA population. DMA was removed by 
using the DNA-fnae rctagent kit CAmbfen). equal aliquots of total RNA were 
reverse transcribed by usMgthe Teqmen Reverse Transcription kit with random 
hexamors according to the manufacture's protocol (Applied Biosystems). To 
measure mRNA levels, roa Wmc PCR w« performed with SYBR green PCft master 
mix (Applied Biosystems) and primers specific to exon 4 of I £1 (S'-GCCTTCCCTAA- 
GACCACCAAT-3* and 5'-ATTTTCTGGGCATAAGCCATAAT03') or axon t of UL37 
(5'^AC<5AAGTCCGATGA<3<3A<SGATG^- and 5'-TGC5CACAaC{3<3C<3TTfjrTu> 
3'). 

To measure levels of miR-UL112-t, a modified TaqMan-based Stem loop RT- 
PC8 was pert ofrned 0$). TaqMan MkroRNA Reverse Transcription kit (Applied 
Biosystems) was used according to the manufacturers protocol with the follow- 
ing stem-loop Ollgonudoottdo: S'-GTC^TATCCAGTGCACGGTCCGAGGTAT- 
TCG CACTG G ATACG ACAG CCTG-3 ' . A 1:15 dilution of the product from the re- 
verse transcriptase reaction was used in a TaqMan quantitative PCR along with 
1.5 mM of forward primer C5'-GCCGCAAGTFACGGTGAGAT-3 \ 0,7 mM of re- 
verse primer (5'-GTGCAGGGTC(!GAGGT-3'). 0-2 mM of TaqMan probe (5'-6FAM- 
TACGACAG CCTGCAT-mgbMFQ) and 1 x Universal T*qM*n PCR Master mix (Ap- 
plied Biosystorrc). The results were normalised by quantifying the leveb of 



human UCB small nudear RNA using the RNU68 Teqmen control assay (Applied 

Biosystems). 

Protein Quantification. MRCS cells were infected at a multiplicity of 3 pfu per celK 
Cells were starved formethionine and cystine before labeling by incubating for 
1 h in medium with 1 0% dlaJyzed FBS. EasyTag Express Protein labeling mix (100 
fiCi: PerxinElmer) wosaddedtothocellsfor 1 h, aftefVmJch the labeling medium 
was replaced with medium containing 10% PCS for 10 min to allow stalled 
translation to complete- Cells were washed in PBS and then lysed in buffer 
containing 20 mM Tris acetate pH 7.5, 0.27 M Sucrose, 1 mM EDTA, 1 rnM 6GTA, 
1 mM sodium Ofthovanadatc, 10 mM sodium ^glycerophosphate, 50 mM so- 
dium i loridc, 5 mM sodium pyrophosphate, and 1 % Triton X-1 00. One tablet of 
Compfoto Mini Protease inhibitor (Roche Applied Science) was added per to ml 
of lysis buffer. Protein concentration was calculated by Bradford assay. 

Aliquots (10 vg) were subjected to Western blot assay using monoclonal 
antibodies specific for hcmv |£l (1B12), HCMV u<4fe (i0B4) and monoclonal 
anti-tubulin antibody (Sigma-Aldrkh). An anti-mo use KRP conjugated antibody 
was used along with the ECL plus detection kit (Amersham) to detect specific 
bands. Chemilumimiseonee was analysed by using a Phosphorimagerand image- 
Ouarrt TL software (GE Healthcare Life Sciences. Piscataway. NJ). 

For immunoprocipitation assays, aliquots of lysate {5 or 1 0 j*g protein) were 
predeered with Protein A/G Plus Agarose beads (Santa Cruz Biotechnology) for 
4 h at 4"C ArrtNEl monodonal antibody (1B12) and Protein A/G Plus Agarose 
were added to the supernatant which was incubated overnight at 4*C with 
shaking. Immunopretipitated complexes were washed three times with RIPA 
buffer (50 mM Tris-HCl pH 7.4. 1 % Nonidet FM0. 0,25% No-rJecwychc4ate, 1 50 mM 
NaO, 1 mM EDTA) supplemented with Cbmploto Mini Protease inhibitor. Beads 
were boiled Jn 2x SOS loading buffer and run on an 8% SOS/PAGE gel to separate 
the immunopreciprtarted complexes. Gels were dried and exposed to a phosphor 
screen, which was analyzed by using a Phosphorimagor and imageQuont TL 
software. 
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